All-to-All, Low Latency, and High-Throughput Optical Interconnect Demonstrator for Scalable Supercomputing
نویسندگان
چکیده
I. Project Summary This project addresses five key elements for future high-performance computing: (1) high-throughput, (2) lowlatency, (3) energy-efficiency, (4) scalability, and (5) high-radix, all-to-all connectivity. Computing and data centers today are dissipating Megawatts of power, and scaling to an exascale system would require nearly a Gigawatt [1]. Amdahl’s law suggests that a system with balanced computation, memory, and communications performs best under most circumstances, but today’s computing systems are misbalanced by typically more than two orders of magnitude lacking communication bandwidths. Further, they are also lacking high-radix and all-to-all interconnect capabilities that can benefit many important applications. Future exascale high-performance computing and data systems are limited in its scalability and performance due to interconnects, and transformative and new interconnect technologies are essential. This project demonstrates a new hardware and software solutions to interconnects for future high-performance computing. It demonstrates a new computing system with wavelength routed optical interconnects that (1) scale to ~ million compute nodes and Exabyte/s bisection-bandwidth, (2) support all-to-all connectivity without contention or arbitration with the radix count 512 and beyond, (3) exploit ~ pJ/bit communication energy efficiency, (4) achieve normalized throughput of ~100% in Racks, ~97% in Clusters, and ~80% in Full Systems 100% at 100% data injection rate on every transmitter port in the system, (5) validate the performance through an actual hardware system demonstration and a clock-cycle accurate simulator, and (6) demonstrate a chip-scale implementation using silicon photonics.
منابع مشابه
An Optical Packet - Switched Interconnect for Supercomputer Applications ∗
We describe a low-latency, high-throughput scalable optical interconnect switch for high-performance computer systems that features a broadcast-and-select architecture based on wavelengthand space-division multiplexing. Its electronic control architecture is optimized for low latency and high utilization. Our demonstration system will support 64 nodes with a line rate of 40 Gb/s per node and op...
متن کاملSystem simulation methodology of optical interconnects for high- performance computing systems
The relentless quest for processing speed in the range of teraflops and beyond has accelerated the need for scalable, parallel, high-performance computing (HPC) systems. To meet this high bandwidth and low power requirements, optical interconnect-based system architectures are being implemented by the HPC industry. While computer-aided design tools have significantly assisted electronic system ...
متن کاملSense and nonsense of logic-level optical interconnect: reflections on an experiment
Centimeter-range high-density optical interconnect between chips is coming into reach with current optical interconnect technology. Many theoretical studies have identified several good reasons why to use such types of interconnect as a replacement of various layers of the traditional electronic interconnect hierarchy. However, the true feasibility and usefulness of optical interconnects can on...
متن کاملA Class of Highly Scalable Optical Crossbar-Connected Interconnection Networks (SOCNs) for Parallel Computing Systems
ÐA class of highly scalable interconnect topologies called the Scalable Optical Crossbar-Connected Interconnection Networks (SOCNs) is proposed. This proposed class of networks combines the use of tunable Vertical Cavity Surface Emitting Lasers (VCSEL's), Wavelength Division Multiplexing (WDM) and a scalable, hierarchical network architecture to implement large-scale optical crossbar based netw...
متن کاملScalable, High-Throughput, Low-Latency AWGR-based Optical Switches with Distributed Control Plane for Future Computing Systems
We summarize our research on LION switches exploiting wavelength routing in arrayed waveguide grating routers (AWGRs). We discuss the loopback-buffer, alloptical-NACK and all-optical-TOKEN architectures, presenting their effectiveness in terms of network performance and experimental studies. OCIS codes: (200.4650) Optical Interconnects; (200.6715) Switching.
متن کامل